Week 2 - Discussion Questions

These are example discussion points for you to think about before class. You are not expected to engage with all of them — pick the ones that speak most directly to your own research, and bring two or three rough answers to the in-class session. The full description of how to use these pages, including what the question tags mean, is on the Week 1 Discussion page.

Sub-lessons

Fine-Tuning, RLHF and Alignment

Calibrate Pick a recent behaviour you have noticed in a chat model — a refusal, a flash of sycophancy, a particular tone, a hedged answer to a controversial question. Place it most plausibly in the SFT, RLHF, or post-deployment guardrail layer. What evidence would push it to a different layer?
Apply For research use specifically, which alignment stage matters most for the quality of the AI assistance you get? Be concrete about a research task you actually do and which behavioural property (helpfulness, calibration, refusal patterns) you would want shaped by which stage.
Critical The lesson notes that labs are deliberately secretive about post-training. Which claims about model behaviour can you still make confidently without knowing the details, and where do you have to stop short and label the inference as speculative?
Connect Week 1 introduced LLMs and transformers as “attention machines” that learn from text. This lesson reframes them as systems that have been deliberately shaped by post-training. Which of the two framings is closer to the “mental model” you reach for when you sit down to use Claude or ChatGPT, and how does using both at once change what you would and would not delegate to the tool?

How AI Image Generation Works

Calibrate After watching the video, write a one-sentence explanation of what a diffusion model actually does. Test the sentence against a recent image-generation model card or paper from your own field. Where does the sentence hold, and where does it stretch?
Apply For a researcher in your field, sketch one concrete legitimate use of image generation in your work, and one use that would cross into the territory of fabricating data. What is the principle that separates them?
Critical The video explains image generation through visual analogy. Which of the underlying issues — training-data provenance, copyright, evaluation, downstream harm — does the pedagogical framing under-discuss, and what would have been lost if it had spent more time there?
Connect Image-generation models work differently from LLMs but face many of the same training-data, alignment, and evaluation questions. Pick two lessons from the LLM material this week that transfer cleanly to image models, and one that doesn't.

LLM Architecture Deep Dive

Calibrate The lesson covers positional encoding (RoPE, ALiBi, etc.). Pick one concrete behaviour you have seen in a long-context LLM — performance degrading after some token count, lost-in-the-middle effects, sudden incoherence at the boundary — and trace it to a specific architectural choice. What evidence would confirm or refute the trace?
Apply As a researcher, when does it matter that you understand the transformer architecture in detail, and when can you treat the model as a black box? Sketch a working decision rule for yourself.
Critical The lesson covers attention, positional encoding, and tokenisation as if they were a coherent family, but the actual implementations vary substantially across model families. Where can you safely generalise across families, and where should you stop?
Connect Week 1 introduced transformers via 3Blue1Brown's visual analogy. Now that you have read the architectural deep dive, where does the visual analogy still hold and where does it break? Which framing — the visual one or the architectural one — does more work for you when you have to explain why a model behaves the way it does?

Training Large Language Models

Calibrate The lesson introduces scaling laws as a way to predict model performance from compute, data, and parameters. Pick a recent model release and try to place its actual benchmark performance against what a scaling-laws prediction would have said. Where do they agree, and where do they diverge?
Apply For your own research, when does training or fine-tuning a model from scratch make more sense than using a pre-trained API? Sketch the data, scale, and infrastructure conditions that would make that decision sensible — and the conditions under which it never would.
Critical The lesson covers data curation, scaling laws, and optimisation as if they were largely settled techniques. Which of the three is most likely to be substantially rewritten in the next two years, and what would such a rewrite mean for research planning you do now?
Connect Week 1's history-of-AI lesson covered earlier “data eras” (toy domains, hand-curated corpora, web crawls). Place the present scale-and-curation picture in that arc. Where does the present moment look like an extension of a long-running trend, and where does it look like a discontinuity?